[SPARK-22883][ML][TEST] Streaming tests for spark.ml.feature, from A to H#20111
[SPARK-22883][ML][TEST] Streaming tests for spark.ml.feature, from A to H#20111jkbradley wants to merge 2 commits intoapache:masterfrom
Conversation
There was a problem hiding this comment.
No existing test to use
There was a problem hiding this comment.
Moved from object to class b/c this needed testTransformer from the MLTest mix-in
There was a problem hiding this comment.
No existing unit test to use
|
Test build #85494 has finished for PR 20111 at commit
|
There was a problem hiding this comment.
Rearranged this test so it checks each row independently.
There was a problem hiding this comment.
ditto: rearranged to do validity check per-row
There was a problem hiding this comment.
So here do not need to select "keys" column ?
There was a problem hiding this comment.
I don't think we have to. The main thing here is to make sure that the transform really does happen. Other tests check validity of the values.
There was a problem hiding this comment.
OK. so I prefer to use simpler code:
testTransformer[Tuple1[Vector]](dataset.toDF(), brpModel, "values") {
case Row(values: Seq[_]) =>
...
|
LGTM except a tiny issue. :) |
|
Test build #85789 has finished for PR 20111 at commit
|
12b3dcf to
448668d
Compare
|
Updated! Thanks @WeichenXu123 -- I'll merge this once tests pass. |
|
Test build #87857 has finished for PR 20111 at commit
|
|
Merging with master and branch-2.3 |
…to H ## What changes were proposed in this pull request? Adds structured streaming tests using testTransformer for these suites: * BinarizerSuite * BucketedRandomProjectionLSHSuite * BucketizerSuite * ChiSqSelectorSuite * CountVectorizerSuite * DCTSuite.scala * ElementwiseProductSuite * FeatureHasherSuite * HashingTFSuite ## How was this patch tested? It tests itself because it is a bunch of tests! Author: Joseph K. Bradley <joseph@databricks.com> Closes #20111 from jkbradley/SPARK-22883-streaming-featureAM. (cherry picked from commit 119f6a0) Signed-off-by: Joseph K. Bradley <joseph@databricks.com>
…to H ## What changes were proposed in this pull request? Adds structured streaming tests using testTransformer for these suites: * BinarizerSuite * BucketedRandomProjectionLSHSuite * BucketizerSuite * ChiSqSelectorSuite * CountVectorizerSuite * DCTSuite.scala * ElementwiseProductSuite * FeatureHasherSuite * HashingTFSuite ## How was this patch tested? It tests itself because it is a bunch of tests! Author: Joseph K. Bradley <joseph@databricks.com> Closes apache#20111 from jkbradley/SPARK-22883-streaming-featureAM. (cherry picked from commit 119f6a0) Signed-off-by: Joseph K. Bradley <joseph@databricks.com>
What changes were proposed in this pull request?
Adds structured streaming tests using testTransformer for these suites:
How was this patch tested?
It tests itself because it is a bunch of tests!